perf+feat: pipeline optimization, inspectr fast loading, plot UI improvements by astafan8 · Pull Request #441 · toolsforexperiments/plottr

astafan8 · 2026-04-15T11:51:33Z

Summary

Performance optimizations for the DataDict pipeline, faster inspectr database loading, plot UI improvements, LaTeX label support for pyqtgraph, and regression fixes from user testing.

Performance: DataDict pipeline (core)

copy(deep=True/False) — Use ndarray.copy() instead of deepcopy (14.8x faster). Added deep parameter following xarray convention.
is_invalid() dtype fast-path — Skip a == None for numeric dtypes, use np.isnan() directly. 44x faster on 963k complex128 arrays, cascading to 2.8x faster datadict_to_meshgrid.
structure() / _build_structure() — Internal helper skips validation for known-good callers.
validate() monotonicity — Direct min/max instead of np.unique + np.sign.
label() — Removed redundant validate() call; uses .get() with defaults for robustness.
mask_invalid() / extract() — Eliminated redundant copies.
largest_numtype() — Direct dtype check (15,000x faster).
_find_switches() — Vectorized percentile + filter (2.6x faster grid guessing).
datasets_are_equal() — Short-circuit on shape mismatch.
Node process() — Deferred structure() call; only recomputed when structure actually changes.
XYSelector — Removed cascading copy. datadict_to_meshgrid uses copy=False.
Plot splitting — dataclasses.replace instead of deepcopy for PlotItem.

Performance: Inspectr database loading

Fast SQL overview — New plottr.data.qcodes_db_overview module with single JOIN query (~14ms vs minutes for large DBs).
Records counter — Counts rows from results table (not result_counter). Falls back to shape info from run_description when results table doesn't exist.
Incremental refresh — Only loads runs newer than the last known run_id; vectorized DataFrame merge.
Lazy snapshot tree — Built on-expand, not on-load (951ms to 0.3ms per click).
Collapsed by default — Info pane starts collapsed; smooth pixel scrolling.
Loading progress overlay — Live "Loading database... (N/M datasets)" feedback.

Performance: Plot backends

metadataShape default — Both backends default to GridOption.metadataShape when QCodes shape metadata exists, skipping expensive guess_grid_from_sweep_direction.
Matplotlib replot optimization — _inSetData flag suppresses redundant _plotData calls from toolbar signals during setData().

Fixes: Pyqtgraph backend

Axis orientation — Transpose z data in setImage() to match matplotlib convention. Fixes 90 degree rotated image plots.
Grid resize — Reset row/column stretch factors when subplot count changes. Fixes plots staying small after reducing from many to few subplots.
Complex mode switching — Reset imagData flag before checking data. All complex representations (Real, Re/Im, Split Re/Im, Mag/Phase) available for 1D and 2D data.
Minimum widget size — Set 40x40 minimum on PlotBase to reduce QFont::setPointSize warnings.

Fixes: Matplotlib backend

Blank plot on first load — _inSetData flag ensures plotType is set correctly during setData().

UI: Data selector buttons

Select all — Selects all dependent variables. Single signal emission (batch).
Select first only — Selects only the first dependent (default view).
Select all 1D — Selects only 1D dependents. Only visible when 1D variables exist.
Select all 2D — Selects only 2D dependents. Only visible when 2D variables exist.

UI: Plot layout and controls

Plot backend selector — Combo box in inspectr toolbar to switch between matplotlib and pyqtgraph.
Grid layout for pyqtgraph — QGridLayout with equal stretch, matching matplotlib's near-square subplot arrangement.
Scrollable plot area — Toggle + min-height spinbox in both backends. Off by default.
Wider default window — Inspectr opens at 1400x900.
Matplotlib colormap selector — Combo box in toolbar with popular colormaps first, then all matplotlib colormaps. Changes take effect immediately.

UI: LaTeX labels in pyqtgraph

plottr.utils.latex — LaTeX-to-HTML conversion for Qt rich text (Greek letters via unicodeit, subscripts, superscripts, fractions, square roots).
Detection guard — Only triggers on actual LaTeX syntax. Plain strings pass through unchanged.

Code quality and typing

find_scale_and_prefix import — Updated to qcodes.plotting.axis_labels with fallback chain for older qcodes and when qcodes is not installed.
Mypy clean for both PyQt5 and PyQt6 — Per-module warn_unused_ignores override for modules with cross-backend type: ignore comments.
Timestamp parsing — datetime.fromisoformat instead of string slicing.

Tests (new)

test_datadict_copy_semantics.py — 64 copy/isolation tests
test_pipeline_coverage.py — 63 pipeline tests with hypothesis
test_round2_optimizations.py — 32 optimization tests
test_gridder_comprehensive.py — 62 gridder tests
test_latex.py — 38 LaTeX conversion tests
test_plotting.py — 17 new tests: axis orientation, complex splitting, mpl first-plot, pyqtgraph complex modes
test_data_selector.py — 8 new tests: selection buttons
test_qcodes_data.py — 4 new tests: records counter, dataset refresh

Documentation

PERFORMANCE_PLAN.md — Profiling analysis, benchmarks, and future optimization suggestions.

Major performance improvements to plottr's data pipeline: - Rewrite copy() with targeted per-key semantics (14.8x faster for meshgrid) - Add copy(deep=False) API for shallow copies (xarray convention) - Optimize MeshgridDataDict.validate() monotonicity check (1.5x faster) - Add _build_structure() to skip redundant validation in internal callers - Fast-path mask_invalid() to skip clean data (65,000x memory reduction) - Fix cascading copies in XYSelector (was copying twice via inheritance) - Pass copy=False in DataGridder to avoid redundant array duplication - Optimize datasets_are_equal() with shape short-circuit - Fix bug: copy() now properly deep-copies global mutable metadata Adds 127 new tests covering copy semantics, pipeline integrity, various data shapes/dtypes, and edge cases (hypothesis property-based testing). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Comprehensive analysis of remaining performance improvements across: - HDF5 loading (reads full dataset for shape metadata - critical fix) - Node.process() redundant structure() call on every update - Complex plot rendering deepcopy overhead - Signal emission overhead (7 signals per node per update) - largest_numtype() iterating every array element as Python objects - Various numpy anti-patterns (np.append in loops, unnecessary copies) - Architectural improvements (change detection, memoization) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Round 2 performance improvements: - largest_numtype(): use numpy dtype instead of iterating every element as a Python object (~15,000x faster for numeric arrays) - Node.process(): defer structure() call to only when structure changes (50x faster steady-state updates for large meshgrids) - is_invalid(): skip unnecessary np.zeros allocation for non-float arrays - guess_grid_from_sweep_direction(): convert once with np.asarray, not 4x - remove_invalid_entries(): replace O(n^2) np.append with list+concatenate Also fixes crash on inhomogeneous index arrays (pre-existing bug) - meshgrid_to_datadict/datadict_to_dataframe: ravel() instead of flatten() - _splitComplexData(): dataclasses.replace instead of deepcopy Adds 32 new tests (205 total passing). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Rename loop variable 'd' to 'diffs' to avoid shadowing the outer loop variable from 'for d in self.dependents()'. Add explicit type annotations for ndarray variables to satisfy mypy's type narrowing. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Benchmarked the full plottr pipeline (load -> DataSelector -> DataGridder -> XYSelector) on 23 QCodes datasets of varying shapes and sizes. Pipeline total: 1478 ms -> 1025 ms = 1.44x overall speedup. Largest gains on big datasets: stability_diagram 1.81x, large_3d_scan 1.65x. No regressions on any dataset. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Pipeline total: 6,550 ms -> 3,465 ms = 1.89x overall speedup on 8 large datasets (4M-point 1D, 800x800 2D, 100x100x80 3D, interrupted, multi-dep). Consistent ~2x speedup across 1D/2D shapes, ~1.7x on 3D. Loading times unchanged (QCodes SQLite I/O dominated). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

New benchmark measures both cold start (new flowchart) and steady state (persistent flowchart, simulating live monitoring refresh). Uses 5 repeats with warmup, reports median. Results on 31 datasets (23 small + 8 large): - Large datasets: cold 1.88x, steady 1.77x faster - Small datasets: cold 1.43x, steady 1.69x faster - Steady-state on small data shows up to 2.11x speedup Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Benchmarks real user actions (switch dep, swap axes, toggle subtract avg, slide dimension, toggle grid) on large datasets with per-node time breakdown. Key findings: - DataSelector: 10-17x faster (largest_numtype O(1), copy optimized) - SubtractAverage: 6-29x faster (copy 15x faster, mask_invalid skips clean) - ScaleUnits: 7-15x faster (copy 15x faster) - XYSelector: 1.5-2.3x (cascading copy removed) - DataGridder: 1.1x (dominated by actual gridding computation) Action-level: toggle_subtract_avg 9-10x, swap_xy 3.3x, switch_dep 2.3x, data_refresh 2.2x, slide_dimension 1.5-1.6x. DataGridder is now the dominant cost (58% of pipeline) and is the next frontier. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

The shape-guessing algorithm (_find_switches -> find_direction_period -> guess_grid_from_sweep_direction) was the #1 bottleneck after rounds 1-2. Optimizations: - Compute is_invalid() once instead of 3 times per call - Single np.percentile([lo, hi]) call instead of two separate sorts - Direct numpy subtraction instead of MaskedArray creation - Vectorized boolean mask instead of Python list comprehension - np.nanmean for NaN-safe sweep direction detection - Cached np.std in guess_grid_from_sweep_direction Results (800x800 = 640K pts): - _find_switches: 80ms -> 31ms (2.6x) - datadict_to_meshgrid: 175ms -> 71ms (2.5x) - Cumulative pipeline speedup vs master: 2.8-3.5x Adds 62 comprehensive gridder tests covering all GridOption paths, edge cases, various shapes, noisy axes, incomplete data. All 267 tests pass. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Benchmarked on production quantum device datasets: - QDstability (223 MB, 16 deps): cold 3.56x, steady 2.93x faster - TopogapStage2 (152 MB, 21 deps, 4D): cold 2.47x, steady 2.7x faster - QDtuning (14 MB, 16 deps): steady 2.73x faster - DataSelector node: 12-13x faster on these multi-dependent datasets Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Lazy snapshot loading in RunInfo: - Snapshot tree widget items are built only when the user expands the 'QCoDeS Snapshot' section, not on every click - Saves ~951ms per click on datasets with 5.9 MB snapshots (3,554x faster) - Info pane shows collapsed by default instead of expandAll() Incremental DB refresh: - refreshDB() now loads only new runs since last refresh using the start parameter of get_runs_from_db() - Merges incremental results into existing dataframe - First load still loads everything All 267 tests pass. No mypy errors introduced. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Add get_runs_from_db_fast() which uses load_by_id() directly per run, bypassing the O(N^2) experiments() + data_sets() enumeration. For 1496 runs: old approach takes 15+ minutes, new takes ~5 seconds. Incremental refresh loads only new runs since last known run_id. LoadDBProcess now uses get_runs_from_db_fast with start_run_id parameter for both initial load and incremental refresh. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

RunList now shows contextual overlay messages: - 'Loading database... (N/M datasets)' with live progress during load - 'Select a date on the left to browse datasets.' when idle - 'No datasets found in this database.' for empty DBs - 'No datasets match the current filter.' when star/cross filters hide all Progress is reported from the worker thread via progressUpdated signal, updated every 10 datasets for smooth display without overhead. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Check actual RunList widget state (topLevelItemCount) instead of _selected_dates to decide whether to show the hint text. This handles same-file reload, empty date selection, and filter edge cases. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Data structure and Metadata sections now collapsed by default (user expands what they need) - Set ScrollPerPixel on RunInfo tree widget so tall rows (e.g., long exception tracebacks in metadata) can be scrolled smoothly instead of jumping to the next row Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Adds a combo box in the toolbar to switch between matplotlib and pyqtgraph backends. Default is matplotlib. The selection applies to all newly opened plot windows. Existing windows keep their backend. The combo box respects the --plotWidgetClass passed via constructor (e.g., from script_pyqtgraph entrypoint). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

New module plottr/data/qcodes_db_overview.py: - get_db_overview(): single SQL JOIN query for all run metadata - Skips snapshot and run_description blobs entirely - Reads inspectr_tag directly as a column from runs table - 6x faster than load_by_id, ~1000x faster than experiments() enumeration - Intended for eventual contribution to QCoDeS Inspectr LoadDBProcess now uses SQL path by default with automatic fallback to qcodes API (get_runs_from_db_fast) if SQL fails. Also: default window size widened from 640x640 to 960x640. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Replace the single-column QSplitter with a QGridLayout that arranges subplots on a near-square grid, using the same formula as matplotlib: nrows = int(n ** 0.5 + 0.5) ncols = ceil(n / nrows) This makes pyqtgraph behave like matplotlib when plotting many dependents: plots are arranged in columns (e.g., 4 plots = 2x2, 6 = 2x3, 16 = 4x4) instead of stacking vertically. A scroll area wraps the grid so very many plots remain accessible. Each plot has a minimum height of 250px to stay readable. 280 tests pass, 0 mypy errors. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Add 'Scrollable' checkbox in both pyqtgraph and matplotlib toolbars: - Enabled by default - When many subplots exist, the plot area expands beyond the window and becomes scrollable, keeping each subplot readable - Can be unchecked to fit everything into the visible window PyQtGraph: min plot height reduced from 250px to 75px. Matplotlib: canvas wraps in QScrollArea, min height set per grid row. 280 tests pass, 0 mypy errors. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Both backends: - Scrollable is now OFF by default - Added a 'px' spinbox next to the Scrollable checkbox showing the minimum height per subplot row (default 75px pyqtgraph, 100px mpl) - Spinbox is only enabled when Scrollable is checked - Minimum value is 40px - Changing the spinbox value triggers replot 280 tests pass, 0 mypy errors. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

The inspectr backend selector passes plotWidgetClass to autoplotQcodesDataset, but the function signature was missing this parameter on the branch. Also passes it through to QCAutoPlotMainWindow. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Extract 'Select a date...' string into _SELECT_DATE_HINT constant - Replace string-magic backend detection with explicit _PLOT_BACKENDS mapping (display name -> class) - _backend_name_for_class() for reverse lookup - Unknown plotWidgetClass added to combo with its class name as label - _onBackendChanged uses the mapping instead of hardcoded imports 280 tests pass, 0 mypy errors. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- _split_timestamp(): proper datetime parsing instead of string slicing for splitting qcodes timestamp strings into date/time components. Applied to both get_ds_info() and _ds_to_info_dict(). - get_runs_from_db_fast(): removed unnecessary initialise_or_create_database_at call, use same read_only pattern as get_runs_from_db. - qcodes_db_overview: use conn_from_dbpath_or_conn from qcodes instead of raw sqlite3.connect. Remove unused get_last_run_id function. - mpl/widgets: remove dead _scrollable attribute and fix setScrollable which had identical code in both branches. 280 tests pass, 0 mypy errors. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

New module plottr/utils/latex.py using unicodeit (now a required dependency): - Greek letters: \alpha -> α, \Omega -> Ω - Math symbols: \hbar -> ℏ, \partial -> ∂, \int -> ∫ - Subscripts: V_{gate} -> Vgate (HTML for text, Unicode for digits) - Superscripts: x^{2} -> x² (Unicode), e^{iπ} -> eiπ - Fractions: \frac{dI}{dV} -> dI/dV - Square root: \sqrt{x} -> √x - Dollar delimiters stripped Applied to pyqtgraph axis labels in FigureMaker.formatSubPlot(). Falls through gracefully on plain text (no LaTeX = no change). 35 new tests including hypothesis property-based testing. 315 tests pass, 0 mypy errors. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Convert subscripts and superscripts to HTML tags BEFORE running unicodeit, so they become 11 and 2 instead of Unicode ₁₁ and ². HTML tags render more consistently in Qt rich text. unicodeit still converts Greek letters and symbols inside the tags. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Plain strings with underscores (e.g. gate_voltage, channel_1_amplitude) now pass through unchanged. Conversion only triggers when the string contains backslash commands (\alpha), dollar delimiters ($...$), or braced sub/superscripts (_{...}, ^{...}). Also drops single-char bare sub/sup patterns (_x, ^x) which were too aggressive on ordinary identifiers. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Add qcodes.utils.plotting to mypy ignore_missing_imports (module removed in newer qcodes) - Add hypothesis to test_requirements.txt for CI - Fix rettype initialization in combine_datadicts (type narrowing) - Fix plotOptions unpacking when None (mpl autoplot) - Fix widgetConnection.get type mismatch (autonode) - Remove stale type: ignore comments (inspectr) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Restore type: ignore[has-type] on inspectr starAction/crossAction (needed by PyQt5-stubs, unused with PyQt6) - Add type: ignore[arg-type] on autonode widgetConnection.get (needed with PyQt6, unused with PyQt5-stubs) - Set warn_unused_ignores = false since project targets both Qt bindings Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

qcodes moved find_scale_and_prefix from qcodes.utils.plotting to qcodes.plotting.axis_labels. Update the import chain to try the current location first, then the old one, then the local fallback. Remove qcodes.utils.plotting from mypy ignore_missing_imports since the import is now handled via try/except with proper type: ignore annotations. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Revert rettype initialization to None + assert before use (cleaner than pre-computing a default that changes the original semantics) - Remove dead local fallback for qcodes < 0.21 (min supported is 0.54.1); keep old-path fallback with version comment for < 0.46 - Restore warn_unused_ignores = true globally; add per-module override (warn_unused_ignores = false) only for modules with cross-Qt-backend type: ignore comments (inspectr, autonode, scaleunits) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot

Pull request overview

This PR focuses on speeding up Plottr’s data pipeline and inspectr DB browsing, while improving plot UI/UX across backends.

Changes:

Optimizes core DataDict/numeric utilities and node processing to reduce copying/validation overhead.
Adds fast inspectr DB loading (direct SQL overview + incremental refresh) and lazy snapshot rendering.
Improves plotting UX: pyqtgraph subplot grid layout, scrollable plot areas, LaTeX-ish label rendering, and backend selection.

Reviewed changes

Copilot reviewed 24 out of 24 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
test_requirements.txt	Adds `hypothesis` for new property-based tests.
pyproject.toml	Adds `unicodeit` dependency and mypy overrides.
plottr/utils/num.py	Performance-oriented rewrites for numeric helpers and grid guessing.
plottr/utils/latex.py	New LaTeX-to-HTML label conversion helper (uses `unicodeit`).
plottr/plot/pyqtgraph/autoplot.py	Grid-based subplot layout, scrollable plot area, LaTeX label rendering, new toolbar options.
plottr/plot/mpl/widgets.py	Wraps MPL canvas in a scroll area.
plottr/plot/mpl/autoplot.py	Makes plotting robust to `None` plot options; adds scrollable plot controls.
plottr/plot/base.py	Avoids `deepcopy` for complex splitting via `dataclasses.replace`.
plottr/node/scaleunits.py	Updates `find_scale_and_prefix` import path/fallback logic.
plottr/node/node.py	Defers structure snapshot creation to only when structure changes.
plottr/node/grid.py	Avoids extra copies during gridding (`copy=False` into meshgrid conversion).
plottr/node/dim_reducer.py	Removes redundant copy in `XYSelector.process()`.
plottr/node/autonode.py	Adds a mypy-targeted `type: ignore` for option type handling.
plottr/data/qcodes_db_overview.py	New module: fast run overview via a single SQL JOIN query.
plottr/data/qcodes_dataset.py	Timestamp parsing improvements + incremental/fast DB enumeration helpers.
plottr/data/datadict.py	Major copy/structure/masking/validation performance improvements and bug fixes.
plottr/apps/inspectr.py	Fast loading, incremental refresh, progress/overlay UX, lazy snapshot tree, plot backend selector.
plottr/apps/autoplot.py	Allows injecting plot widget backend into autoplot windows.
test/pytest/test_round2_optimizations.py	New tests for optimized utilities and behaviors.
test/pytest/test_pipeline_coverage.py	New broad pipeline/node coverage incl. hypothesis strategies.
test/pytest/test_latex.py	New tests for LaTeX-to-HTML conversion.
test/pytest/test_gridder_comprehensive.py	Comprehensive gridder/gridding path tests.
test/pytest/test_datadict_copy_semantics.py	Extensive copy semantics + integrity regression tests.
PERFORMANCE_PLAN.md	Benchmarks, rationale, and future optimization plan.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

- Restore local fallback for find_scale_and_prefix when qcodes is not installed (qcodes is an optional dependency) - Move context menu setup from resizeEvent to __init__ to avoid accumulating signal connections on every resize - Replace per-row concat loop in DBLoaded with vectorized update() + single concat for new rows - Clear grid layout in _arrangeGrid before re-adding to avoid stale layout items on repeated calls - Fix setScrollable to use _minPlotHeight instead of hard-coded 75 - Remove unused deepcopy import from test Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Profiled plottr pipeline with actual measurement data ([redacted] downloaded via qdwsdk. Key findings: - is_invalid() is 44x slower than needed (a==None on numeric arrays) - ds_to_datadict takes ~1s steady state (qcodes deserialization) - datadict_to_meshgrid 122ms (avoidable when shape metadata exists) - pyqtgraph eq() adds 24ms per pipeline trigger - mag+phase complex splitting 31ms (inherent cost) Added prioritized improvement suggestions to PERFORMANCE_PLAN.md. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

is_invalid(): For numeric dtypes (float, complex, int), skip the expensive a==None comparison (always False for numeric arrays) and use np.isnan() directly. 44x faster on 963k complex128 arrays (44.6ms -> 1.0ms), cascading to 2.8x faster datadict_to_meshgrid. metadataShape: Move the qcodes_shape check from QCAutoPlotMainWindow into the parent AutoPlotMainWindow.setDefaults(), so both mpl and pyqtgraph backends benefit. When shape metadata exists, skip the expensive guess_grid_from_sweep_direction entirely. Remove now-unused packaging.version import from autoplot.py. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…date is_invalid(): Skip a==None for numeric dtypes (always False), use np.isnan directly. 44.6ms -> 1.0ms on 963k complex128 arrays. Cascades to 2.8x faster datadict_to_meshgrid (122ms -> 44ms). mpl double-replot: setData() was triggering _plotData() twice — once via setAllowedPlotTypes signal and once explicitly. Block toolbar signals during option updates to eliminate the redundant plot cycle. Saves ~500ms per setData on 963x1001 data. label(): Remove validate() call — label() only needs field lookup, not full monotonicity/shape validation. Saves ~33ms per label on large MeshgridDataDicts. metadataShape default: Move qcodes_shape check to parent class so both mpl and pyqtgraph benefit. Remove dead version import. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Mark is_invalid, metadataShape, and mpl double-replot as done. Add backend comparison table (mpl vs pyqtgraph after optimizations). Add artist-level mpl updates as future improvement suggestion. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…alidate Records counter: Count rows from the results table instead of using result_counter (which counts INSERT calls, not data points for array paramtype). Falls back to result_counter when results table doesn't exist (e.g., qdwsdk downloads). is_invalid(): Skip a==None for numeric dtypes — 44x faster on complex arrays, cascading to 2.8x faster datadict_to_meshgrid. mpl double-replot: Block toolbar signals during setData() option updates to eliminate redundant _plotData() call (~20% faster replot). label(): Remove validate() call — label only needs field lookup. metadataShape: Default to metadataShape when qcodes_shape exists, for both backends. Remove obsolete qcodes version check. Add 12 regression tests covering axis orientation, records counter, dataset refresh (incremental + inspector-level), and 1D complex data splitting. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

pyqtgraph imagData: Reset to False before checking data, so switching from complex to non-complex data correctly disables complex options. Grid layout: Add equal row/column stretch factors so all grid cells get equal space, preventing elongated plots when many subplots are arranged in the grid. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Records counter: Count rows from results table instead of using result_counter (which counts INSERT calls, not data points for array paramtype). Falls back to result_counter when table doesn't exist. Complex mode: Reset imagData flag before checking data so switching from complex to non-complex correctly updates toolbar options. Aspect ratio: Add equal row/column stretch factors to pyqtgraph grid layout so all cells get equal space. label(): Use .get() with defaults to handle DataDictBase instances that haven't been validated (no 'label'/'unit' keys yet). Select All / Deselect / 1D / 2D buttons: New buttons in the data selector widget. Batch selection (blockSignals) ensures a single signal emission and single replot per button click. 1D/2D buttons are only visible when the dataset has dependents with that many axes. 21 regression tests covering axis orientation, records counter, dataset refresh, 1D complex splitting, and selection buttons. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Recursion fix: Separate setSelectedData (original, per-item signal) from setBatchSelectedData (new, single signal for batch buttons). The original path through node signalOption/setOption relies on _emitGuiChange flag which the batch emit bypassed. Records counter: Read run_description to extract shapes, compute record count as product of shape dimensions. Falls back chain: results table rows -> shape from run_description -> result_counter. Button layout: Use addLayout instead of addItem to keep buttons inside the NodeWidget's VBoxLayout (fixes separate window issue). label(): Use .get() with defaults for missing label/unit keys. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Rewrite records counter tests to generate data in tmp_path instead of referencing local fixture files. Remove device identifiers and dataset GUIDs from PERFORMANCE_PLAN.md. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

mpl blank plot: Replace blockSignals approach with _inSetData flag that suppresses redundant _plotData calls from toolbar signals during setData(). Simpler and doesn't interfere with signal delivery. pyqtgraph grid resize: Reset all row/column stretch factors and minimum height before re-arranging the grid. Fixes plots staying small after reducing from many subplots to few. Reorganize tests: Move tests from test_regressions.py into their proper homes: - Axis orientation, complex splitting, mpl first-plot -> test_plotting - Selection buttons -> test_data_selector - Records counter, dataset refresh -> test_qcodes_data Delete test_regressions.py. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Both backends: setData(None) now clears existing plots instead of silently returning. Deselect-all produces None from DataSelector, which now correctly empties the plot area. pyqtgraph PlotBase: Set minimum size 40x40 to prevent QFont point size <= 0 warnings when pyqtgraph computes tick labels on zero-sized widgets. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Matplotlib backend: Add colormap combo box to the toolbar. Lists popular colormaps first (viridis, magma, inferno, etc.), then all others. Changing the colormap updates matplotlib rcParams and triggers an immediate replot. Add 5 tests for pyqtgraph complex mode switching: imagData detection, all options available, switch-to-real-and-back, separate Re/Im mode, non-complex shows Real only. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Axis inversion: Transpose z data in pyqtgraph setImage() to match matplotlib convention. The meshgrid first axis maps to the bottom (x) label but pyqtgraph ImageItem needs it transposed for correct display orientation. Deselect all: Remove the button since pyqtgraph's flowchart does not propagate empty selection downstream (by design — None return from process() means 'no change'). The Select All / 1D / 2D buttons already provide sufficient selection control. deselectAll() now selects the first dependent to ensure the plot always has data. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Renamed deselectAll to selectFirst — selects only the first dependent, matching the default behaviour when opening a plot window. Added as a visible button in the data selector toolbar. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…optimization # Conflicts: # plottr/apps/inspectr.py # plottr/data/datadict.py # plottr/node/autonode.py # plottr/node/scaleunits.py # plottr/plot/mpl/autoplot.py # test_requirements.txt

Master cleaned up inspectr type:ignore comments, so the per-module override is no longer needed for that module. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

CI installs PyQt5, not PyQt6. Use plottr's Qt abstraction layer (plottr.QtCore, plottr.QtWidgets) for cross-binding compatibility. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Mikhail Astafev and others added 17 commits April 15, 2026 13:51

astafan8 force-pushed the perf/datadict-copy-optimization branch from a9843a4 to 4263563 Compare April 20, 2026 12:21

Mikhail Astafev and others added 5 commits April 20, 2026 14:27

fix: reduce matplotlib scrollable min height to 100px per row

c663ec7

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

docs: reorganize PERFORMANCE_PLAN.md into implemented vs future

f948fb6

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

astafan8 changed the title ~~perf: optimize DataDict copy, validate, and pipeline data flow~~ perf+feat: pipeline optimization, inspectr fast loading, plot UI improvements Apr 20, 2026

Mikhail Astafev and others added 6 commits April 20, 2026 16:01

astafan8 marked this pull request as ready for review April 21, 2026 07:06

Mikhail Astafev and others added 4 commits April 21, 2026 09:41

jenshnielsen requested a review from Copilot April 21, 2026 11:19

Copilot started reviewing on behalf of jenshnielsen April 21, 2026 11:19 View session

Copilot AI reviewed Apr 21, 2026

View reviewed changes

Mikhail Astafev and others added 11 commits April 21, 2026 13:46

chore: remove unused imports from test_regressions

d382ec1

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

astafan8 force-pushed the perf/datadict-copy-optimization branch from 2f6ab67 to d6fb45d Compare May 1, 2026 13:31

Mikhail Astafev and others added 10 commits May 1, 2026 15:41

ci: trigger CI after history rewrite

2efa69e

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Merge remote-tracking branch 'origin/master' into perf/datadict-copy-…

e6a31e1

…optimization # Conflicts: # plottr/apps/inspectr.py # plottr/data/datadict.py # plottr/node/autonode.py # plottr/node/scaleunits.py # plottr/plot/mpl/autoplot.py # test_requirements.txt

chore: remove inspectr from mypy warn_unused_ignores override

8ec0349

Master cleaned up inspectr type:ignore comments, so the per-module override is no longer needed for that module. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

fix: use plottr Qt imports instead of PyQt6 in tests

15f7b3a

CI installs PyQt5, not PyQt6. Use plottr's Qt abstraction layer (plottr.QtCore, plottr.QtWidgets) for cross-binding compatibility. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

fix: ParamSpecBase import compat for older qcodes in CI

99122f9

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf+feat: pipeline optimization, inspectr fast loading, plot UI improvements#441

perf+feat: pipeline optimization, inspectr fast loading, plot UI improvements#441
astafan8 wants to merge 53 commits into
masterfrom
perf/datadict-copy-optimization

astafan8 commented Apr 15, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

astafan8 commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Performance: DataDict pipeline (core)

Performance: Inspectr database loading

Performance: Plot backends

Fixes: Pyqtgraph backend

Fixes: Matplotlib backend

UI: Data selector buttons

UI: Plot layout and controls

UI: LaTeX labels in pyqtgraph

Code quality and typing

Tests (new)

Documentation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

astafan8 commented Apr 15, 2026 •

edited

Loading